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RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: Q9/G3(j : '2¥2 

Source: l^pO > ^ 

Date Processed by STIC: 2/&/Zoo2~ * 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21heIp@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3he!p(5) / u$pto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
h ttp :/Avww. u spto. go v/web/of fices/p ac/ch ecker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://ymw.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - eP AVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel "Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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Raw Sequence listing Error Siimnwr 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

. 1 Wrapped Nuddc* 

Wrapped Amine* 



SUCCESTED CORRECTION SERIAL NUMBER;^ 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The numbcrAcxt al Ihe end of etch lino "wrapped" down to Ihe next line. This may occur if your file 
was retrieved In a word processor ailer creating H. Please adjust your right margin to J; this will 
prevent "wrapping." - * 



! Invalid Line Length The rulei require Out a line not exceed 72 character! in length. Thii include! white spaces. 



_Misaligned Amino 
Numbering 

Non- ASCII 



Variable Length 



The numbering under each 3* amino add it misaligned. Do not use Ub codes between numbers; 
use space characters, instead 

The submitted Hie wis not saved in ASCUfDOS) text, as required by the Sequence Rules. Please ~ 
ensure your subsequent submission b wved In ASCII text 



our subsequent sub: 

i,T . . 

f$) / contain n i c 



_PatenlIn 2.0 
"bug" 



_Skipped Sequence* 
(OLD RULES) 



_S kipped Sequences 
(NEW RULES) 



Sequencers) / ' contain n*i or Xaa's representing more than one residue. Per Sequence Rule*, 
each n or Xu can only represent a single residue. Please present Ihe maximum number of each 
residue having variable length and indicate in Ihe <220>-<223> seetioxvlhat some may be missing. 

A "bug" in Palenlln version 2.0 has caused the <220>-<223> section lo be missing from amino acid 

sequencers) . Normally. Patenlln would automatically generate this section from the 

previously ceded nucleic add sequence. Please manually copy Ihe relevant <220>-<223> section to 
the subsequent amino add sequence. .This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences.' 

Sequence^) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(t) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID NO where "X",ij shown) 
This sequence is intentionally skipped 

PleasCtUo adjust the "(ii) NUMBER OF SEQUENCES:" response lo Include the skipped sequences. 

Sequences) missing. If Intentional, please insert the following lines for each skipped sequence. 

<2 10> sequence id number 
<400> sequence id number 
000 



_Use of n's or XaVs Use of n's and/or Xaa's have been detected in the Sequence Listing. 
(NEW RULES) 



_Inv*lid<2!3> 
Response 



12 Paten tin 2.0 

"bug" 



Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY" if n's or Xaa's are present 
In <220> lo <223> section, please explain location of n or Xu, and which residue n or Xa* represents. 

Per 1.803 of Sequence i^uTei'^he only valid <2!3> responses are: Unknown, Artificial Sequence, or 
sdentific name (Oenus/spedcs); <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence ■ ■■ f 



Sequence(s)_ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> lo <223> is MANDATORY if <213> "Organism" response ii "Aru'fidal Sequence" or 
"Unknown." Please explain source of genetic material in <220> lo <223> sectioa 
(See -Federal Register," 06/01/1998. VoL 63. No, 1CH, pp. 29631-32) (Sec 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Palenlln version 2.0. This causes a corrupted Hie, 
resulting in missing mandatory numeric identifier* and responses (as indicated on raw sequence 
luting). Instead, please use "File Manager" or any other manual means to copy file lo floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence N is not used to represent 

any value not specifically a nudeotidc, 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/636,24 3 



DATE: 02/21/2002 
TIME: 11:29:22 



Input Set : 
Output Set: 



A:\seqlist.txt 
N:\CRF3\02212002\l636243.raw 



3 
4 
6 
8 
10 
11 
13 
14 
16 
18 
20 
21 
22 
23 
25 
26 
27 
29 
30 
31 
32 
34 
35 
36 
37 
39 
40 
41 
42 
44 
45 
46 
47 
49 
50 
51 
52 
54 
> 55 
56 

W--> 58| 

62 
63 



W 



where 2-4 Xaa's are presei 



<110> APPLICANT: WANG , Bryan S. 

PABO, Carl O. 
<120> TITLE OF INVENTION: DIMERIZING PEPTIDES 
<130> FILE REFERENCE: 8325-1004 / M4-US1 
<140> CURRENT APPLICATION NUMBER: 09/636,243 
<141> CURRENT FILING DATE: 2000-08-10 
<150> PRIOR APPLICATION NUMBER: 60/148,422 
<151> PRIOR FILING DATE: 1999-08-11 
<160> NUMBER OF SEQ ID NOS : 83 
<170> SOFTWARE: Patentln Ver. 2.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence 

motif characterizing C2H2 class proteins 
<220> FEATURE: 
<221> NAME/KEY: REPEAT 
<222> LOCATION: (2) 
<223> OTHER INFORMATION 
<220> FEATURE: 
<221> NAME/KEY: REPEAT 
<222> LOCATION: (17) 
<223> OTHER INFORMATION 
<220> FEATURE: 
<221> NAME/KEY: SITE 
<222> LOCATION: (2) 
<223> OTHER INFORMATION 
<220> FEATURE: 
<221> NAME/KEY: SITE 

<222> LOCATION: (4).. (15) ^ 

<223> OTHER INFORMATION: where Xaa is any amino acid 
<220> FEATURE: 
<221> NAME/KEY: SITE 
<222> LOCATION: (17) 

<223> OTHER INFORMATION: where Xaa is any amino acid 
<400^SEQUENCE: 1 / 
Cysf Xaa] Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa His 
s-**^/ 5 10 15 

f Xaa J His 

V<2K)> SEQ ID NO: 2 
<211> LENGTH: 4 



Does Not Comply 
Corrected Diskette Needed 



I,s4 



exemplary 



where 3-5 Xaa's are present 



ent^) 



where Xaa is any amino acid 



file://C:\Crf3\Outhold\VsrI636243.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/636,24 3 



DATE : 02/21/2002 
TIME: 11:29:22 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\02212002\I636243.raw 



64 <212> TYPE: PRT 

65 <213> ORGANISM: Artificial Sequence 

67 <220> FEATURE: 

68 <223> OTHER INFORMATION: Description of Artificial Sequence: D-able 



71 <400> SEQUENCE: 2 

72 Asn Asn Gly Lys 

73 1 

76 <210> SEQ ID NO: 3 

77 <211> LENGTH: 9 

78 <212> TYPE: DNA 

79 <213> ORGANISM: Artificial Sequence 

81 <220> FEATURE: 

82 <223> OTHER INFORMATION: Description of Artificial Sequence: zinc finger 

83 protein bind sequence 

85 <400> SEQUENCE: 3 

86 ggcgtagac 9 

88 <210> SEQ ID NO: 4 

89 <211> LENGTH: 9 

90 <212> TYPE: DNA 

91 <213> ORGANISM: Artificial Sequence 

93 <220> FEATURE: 

94 <223> OTHER INFORMATION: Description of Artificial Sequence: zinc finger 

95 protein bind sequence 

97 <400> SEQUENCE: 4 

98 ggcgacgta 9 

100 <210> SEQ ID NO: 5 

101 <211> LENGTH: 5 

102 <212> TYPE : PRT 

103 <213> ORGANISM: Artificial Sequence 

105 <220> FEATURE: 

106 <223> OTHER INFORMATION: Description of Artificial Sequence: peptide 

107 ' linker 

109 <400> SEQUENCE: 5 

110 Thr Gly Glu Lys Pro 

111 1 5 

114 <210> SEQ ID NO: 6 

115 <211> LENGTH: 5 

116 <212> TYPE: PRT 

117 <213> ORGANISM: Artificial Sequence 

119 <220> FEATURE: 

120 <223> OTHER INFORMATION,: Description of Artificial Sequence: linker 

122 <400> SEQUENCE: 6 

123 Gly Gly Gly Gly Ser 

124 1 5 

127 <210> SEQ ID NO: 7 

128 <211> LENGTH: 8 

129 <212> TYPE: PRT 

130 <213> ORGANISM: Artificial Sequence 



69 



subsite 
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RAW SEQUENCE LISTING DATE: 02/21/2002 

PATENT APPLICATION: US/09/636,243 TIME: 11:29:22 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\02212002\I636243.raw 

132 <220> FEATURE: 

133 <223> OTHER INFORMATION: Description of Artificial Sequence: linker 

135 <400> SEQUENCE: 7 

136 Gly Gly Arg ' Arg Gly Gly Gly Ser 

137 1 5 

140 <210> SEQ ID NO: 8 

141 <211> LENGTH: 9 

142 <212> TYPE: PRT 

143 <213> ORGANISM: Artificial Sequence 

145 <220> FEATURE: 

146 <223> OTHER INFORMATION: Description of Artificial Sequence: linker 

148 <400> SEQUENCE: 8 

149 Leu Arg Gin Arg Asp Gly Glu Arg Pro 

150 1 5 

153 <210> SEQ ID NO: 9 

154 <211> LENGTH: 12 

155 <212> TYPE: PRT 

156 <213> ORGANISM: Artificial Sequence 

158 <220> FEATURE: 

159 <22 3> OTHER INFORMATION: Description of Artificial Sequence: linker 

161 <400> SEQUENCE: 9 

162 Leu Arg Gin Lys Asp Gly Gly Gly Ser Glu Arg Pro 

163 15 10 

166 <210> SEQ ID NO: 10 

167 <211> LENGTH: 16 

168 <212> TYPE: PRT 

169 <213> ORGANISM: Artificial Sequence 

171 <220> FEATURE: 

172 <223> OTHER INFORMATION: Description of Artificial Sequence: linker 

174 <400> SEQUENCE: 10 

175 Leu Arg Gin Lys Asp Gly Gly Gly Ser Gly Gly Gly Ser Glu Arg Pro 

176 15 10 15 

179 <210> SEQ ID NO: 11 

180 <211> LENGTH: 18 

181 <212> TYPE: PRT 

182 <213> ORGANISM : Artificial Sequence 

184 <220> FEATURE: 

185 <223> OTHER INFORMATION: Description of Artificial Sequence: component 

186 finger of zinc finger protein 

188 <220> FEATURE: 

189 <221> NAME/KEY: REPEAT r ' 

190 <222> LOCATION: (2) /iXA 

191 <2 23> OTHER INFORMATION :^where 2-4 Xaa • s are present 

193 <220> FEATURE: 

194 <221> NAME/KEY: REPEAT 

195 <222> LOCATION: (17) 

196 <223> OTHER INFORMATIOi 

198 <220> FEATURE: 

199 <221> NAME/KEY: SITE 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/636,243 



DATE: 02/21/2002 
TIME: 11:29:22 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\02212002\l636243.raw 



200 


<222> 


LOCATION: (2) 














201 


<223> 


OTHER INFORMATION: 


where 


Xaa 


is 


any 


amino 


acid 


203 


<220> 


FEATURE : 














204 


<221> 


NAME/KEY: SITE 














205 


<222> 


LOCATION: (4) . . (15) 












206 


<223> 


OTHER INFORMATION: 


where 


Xaa 


is 


any 


amino 


acid 


208 


<220> 


FEATURE : 














209 


<221> 


NAME/KEY: SITE 














210 


<222> 


LOCATION: (17) 














211 


<223> 


OTHER INFORMATION: 


where 


Xaa 


is 


any 


amino 


acid 


213 


<400>- 


^SEQUENCE: 11 















W--> 214 Cys (Xaa) Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa His 



222 <211> LENGTH: 30 

223 <212> TYPE: PRT 

224 <213> ORGANISM: Artificial Sequence 

226 <220> FEATURE: 

227 <223> OTHER INFORMATION: Description of Artificial Sequence: DNA binding 

228 domain Fl 

230 <400> SEQUENCE: 12 

231 Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser Arg Ser Asp 

232 15 10 15 

234 Glu Leu Thr Arg His lie Arg lie His Thr Gly Gin Lys Pro 

235 20 25 30 

238 <210> SEQ ID NO: 13 

239 <211> LENGTH: 28 

240 <212> TYPE: PRT 

241 <213> ORGANISM: Artificial Sequence 

243 <220> FEATURE: 

244 <223> OTHER INFORMATION: Description of Artificial Sequence: DNA binding 

245 domain F2 

247 <400> SEQUENCE: 13 

248 Phe Gin Cys Arg lie Cys Met Arg Asn Phe Ser Arg Ser Asp His Leu 

249 1 5 10 15 

251 Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro 

252 20 25 

255 <210> SEQ ID NO: 14 

256 <211> LENGTH: 38 

257 <212> TYPE: DNA 

258 <213> ORGANISM: Artificial Sequence 

260 <220> FEATURE: 

261 <223> OTHER INFORMATION : Description of Artificial Sequence: DNA binding 

262 site 

264 <400> SEQUENCE: 14 

265 ggttgcagtg ggcgcgccca cagtacttga acgtaacg 38 

267 <210> SEQ ID NO: 15 

268 <211> LENGTH: 34 




10 



15 
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RAW SEQUENCE LISTING DATE: 02/21/2002 

PATENT APPLICATION: US/09/636, 243 TIME: 11:29:22 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\02212002\l636243.raw 

269 <212> TYPE: DNA 

270 <213> ORGANISM: Artificial Sequence 

272 <220> FEATURE: 

273 <223> OTHER INFORMATION: Description of Artificial Sequence: DNA binding 

274 site 

276 <400> SEQUENCE: 15 

277 cgttacgttc aagtactgtg ggcgcgccca ctgc 34 

279 <210> SEQ ID NO: 16 

280 <211> LENGTH: 12 

281 <212> TYPE: DNA 

282 <213> ORGANISM: Artificial Sequence 

284 <220> FEATURE: 

285 <223> OTHER INFORMATION: Description of Artificial Sequence: DNA binding 

286 site 

288 <400> SEQUENCE: 16 

289 tgggcgtatg ct 12 

291 <210> SEQ ID NO: 17 

292 <211> LENGTH: 12 

293 <212> TYPE: DNA 

294 <213> ORGANISM: Artificial Sequence 

296 <220> FEATURE: 

297 <223> OTHER INFORMATION: Description of Artificial Sequence: DNA binding 

298 site 

300 <400> SEQUENCE: 17 

301 agcatacgcc ca 12 

303 <210> SEQ ID NO: 18 

304 <211> LENGTH: 57 

305 <212> TYPE: DNA 

306 <213> ORGANISM: Artificial Sequence 

308 <220> FEATURE: 

309 <223> OTHER INFORMATION: Description of Artificial Sequence: DNA binding 

310 site 

312 <400> SEQUENCE: 18 

313 ggaattcctg atcaagatct ggtcacgtcc ataggctagg catgtcaagg ctgtatg 57 

315 <210> SEQ ID NO: 19 

316 <211> LENGTH: 57 

317 <212> TYPE: DNA 

318 <213> ORGANISM: Artificial Sequence 

320 <220> FEATURE: 

321 <223> OTHER INFORMATION: Description of Artificial Sequence: DNA binding 

322 site 

324 <400> SEQUENCE: 19 

325 gggatccact cgcgaacgcg tccttgtagt gggcgcgccc acatacagcc ttgacat 57 

327 <210> SEQ ID NO: 20 

328 <211> LENGTH: 12 

329 <212> TYPE: DNA 

330 <213> ORGANISM: Artificial Sequence 

332 <220> FEATURE: 

333 <223> OTHER INFORMATION: Description of Artificial Sequence: inverted 
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VERIFICATION SUMMARY DATE: 02/21/2002 

PATENT APPLICATION: US/09/63 6 f 24 3 TIME: 11:29:23 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\02212002\I636243.raw 

L:55 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 
L:58 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 
L:214 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:11 
L: 217 M: 341 W: (46) "n" or "Xaa" used, for SEQ ID#:11 
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